RANK ONE PERTURBATIONS AND THE ZEROS OF 
PARAORTHOGONAL POLYNOMIALS ON THE UNIT 

CIRCLE 



BARRY SIMON* 

Abstract. We prove several results about zeros of paraorthog- 
onal polynomials using the theory of rank one perturbations of 
unitary operators. In particular, we obtain new details on the 
interlacing of zeros for successive POPUC. 



1. Introduction 

This note concerns an aspect of the theory of orthogonal polynomials 
on the unit circle (OPUC); for background, see [TTl HI [71 |H1 El • Given a 
nontrivial probability measure, dfi, on = {z e C | 1^1 = 1}, we let 
$„(z) (we use ^n{z,dfi) when dfi needs to be exphcit) be the monic 
orthogonal polynomials. They obey the Szego recursion relations 

<^n+l{z) = Z<l>n{z) - anKi^) (1-1) 
K{^)=Z-^M^) (1.2) 

where {q;„}5J^q G D°° are the Verblunsky coefficients, dfi ^ {«n}$^o 
sets up a one-one correspondence between D°° and nontrivial probabil- 
ity measures (Verblunsky's theorem). 

Given /3 G 9D, the paraorthogonal polynomials (POPUC) are de- 
fined by {Note: |7j uses /3 where fjl.3|) uses —f3; ()1.3|) is the right con- 
vention.) 

<l>n{z, dfi] (3) = z<^n-i{z, dfi) - /3$;_i(z, dfi) (1.3) 
More generally, we will consider a sequence {[3n}'^=i G and 

^n{z) = ^n{z,dll-(3n) (1.4) 

POPUC were introduced at least as early as Jones, Njastad, and 
Thron p. About five years ago, Cantero, Moral, and Velazquez [T] 
and Golinskii [S] realized that zeros of POPUC shared many properties 
of zeros of OPRL and independently proved a number of basic results 
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about these zeros. Cantero et al. P recently proved additional results. 
The basic tool in ^1 |3] is the Christoffel-Darboux formula; j3] also 
exploits the CMV matrix. Our goal in this paper is to use the theory 
of rank one perturbations of unitary matrices to recover many of the 
basic results about zeros of POPUC and prove some new results. In 
particular, we will illuminate the issue of interlacing of the zeros of 
successive POPUC. 

First, some notation. Given distinct z,w E dU), {z,w) is the set of 
points, (, in dlD) with 

Arg(2;) < Arg(C) < Arg(w) (1.5) 

where a branch of Arg is chosen so < Arg(?i;) — Arg(z) < 27r. An 
ordered set of points {zi, . . . ,ze) G is called cyclicly ordered if each 
{zj, Zj+iYj^i and {zi, Zi) contain no other Zj^s. The ordering is fixed by 
such cyclicity up to a single choice. We will always assume zeros of 
POPUC are cyclicly ordered. 

Two cyclicly ordered sets {zi, . . . , Zi) and {wi, . . . , Wi) in dV>^ are 
said to strictly interlace if after a cyclic permutation of the w's, we 
have Wj G {zj, -Zj+i), j = 1,2, . . . , i — 1, we E {ze, zi). This, of course, 
implies Zj G (u^j-i, Wj), j = 2,3, . . . ,i and Zi G {w£, Wi). 

For {aj}'jLQ, the second kind polynomials, \E'„(z,(i/i) are defined, as 
usual, to be the $„'s associated to aj = —aj{dfi). We define 

■^niz, dfi; /3) = z^n_i{z, dfi) - P^*^_iiz, dfi) (1.6) 

We can now state our main results: 

Theorem 1.1 (PHHI)- If{wQ,Wi) is an interval disjoint from supp{dn) , 
then for any choice of {3 and any n, $„(2, d^; (3) has at most one zero 
m {wo,Wi). 

The following has also been proven by Wong [T^ : 

Theorem 1.2. Let {zi, . . . , Zn) be the zeros of some ^n{z,dfi; (3) and 
{wi,...,Wn) of '^n{.z,dii;—(3). {Note: —[3, not p.) Then these zeros 
strictly interlace. 

Theorem 1.3 (jUEl)- Fix d/j, and n and distinct f3,f3' in d3. Then 
the zeros of ^n{z,dfj,] P) and ^n{z,dfi; P') strictly interlace. 

The power of our approach is shown by the refined version we obtain 
relating zeros of and $„. We will need the following computed 
sequence in 5D: 

Xn = Pn^lPn(^^^^) (1-7) 
VPnftn - 1/ 
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Theorem 1.4. For each n, one of two possibilities holds: 

(i) $„ and have no zeros in common. In that case, A„ is not a 
zero of either, and {zeros o/$„}U{A„} strictly interlace {zeros of 

(ii) and have a single zero in common. In that case, A„ is that 
zero and {zeros o/$„} strictly interlace {zeros 0/ $n+i}\{A„}. 

Corollary 1.5. // $1, $2, ^"3; • • • have a common zero at A, then (3n 
are given inductively by 

A = A (1.8) 

/3n+l = A^nf§^^^') (1.9) 

V/?„a„ - 1/ 

Example 1.6. a = 0. Then = A" and $„(-2) = z^ — \^ precisely 
the POPs with a zero at A for all n. □ 

The key to our proofs is the connection of $„ to CMV matrices 
HI . is the determinant of a suitable finite CMV matrix, and 
so its zeros are the eigenvalues. All our results concern what happens to 
eigenvalues of unitary matrices under rank one perturbations. Section|21 
discusses general rank one perturbations, and Section |21 the application 
to POPUC. 

It is a pleasure to thank Mara Jose Cantero and Lilian Wong for 
useful discussions. 



2. Rank One Perturbations 

Rank one perturbations of unitaries are discussed in Sections 1.3.9, 
1.4.16, 3.2, 4.5, and 10.16 of [71 [H], and some of the results in this 
section are spread through that material. 

If U and V are two unitaries on a finite- or infinite-dimensional 
Hilbert space and U — V is rank one, we pick a unit vector ip G 
ker(f/ — V^)"*" and note there must be a A G dV> with 

Vip = XUip (2.1) 

and thus 

V -U = {X-l){^, ■)U^ (2.2) 
It is convenient to define for z G © and A unitary: 

^A,^W = (^, (2.3) 
fAA^) = z-\l-F{z)){l + F{z))-' (2.4) 
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F is a Caratheodory function (Re F{z) > on D, -F(O) = 1) and / 
a Schur function (1/(^)1 < 1 on D). The spectral measure for A,ip is 
given by 

Fa,^{z)= / .g _ dnA,^{z) (2.5) 

J € Z 

It is not hard to see that 

Proposition 2.1. Let ip he a cyclic vector for a unitary A {i.e., 
{y4^(y9}^_^ is total). Au interval {wq^Wi) in is disjoint from 
(Tess(^) if and only if f has an analytic continuation through {wq^Wi) 
with \f{z)\ = 1 on that interval. In that case, 

(a) Arg(/) is strictly monotone increasing on {wo,Wi). 

(b) The only spectra of A on (wq, Wi) are simple eigenvalues precisely 
at the points z where 

zf{z) = 1 (2.6) 

Arg(/) is increasing since \f{z)\ < 1 in © and \f{z) \ = 1 on {wo,wi) 
imphes d\f{re^^)\/dr > on {wo,wi). So by the Cauchy-Riemann 
equations, 9Arg(/(e*^))/(96' > 0. (b) holds since 

^/ N 1 + Zf(z) 

has poles at points where ()2.6p holds. 

When ()2.2|) holds, a direct calculation (see (1.4.90) and the end of 
Section 3.2 in '71) shows that 



Proposition 2.2. // (Q holds, then 

fvA^) = ^~'fuA^) (2.8) 

We immediately have 

Theorem 2.3. Let (j22I) hold. If {wo,wi) (1 a{U) = 0, then V has at 
most one eigenvalue in [wq,Wi] and no other spectrum there. 

Proof. Let JC be the cyclic subspace for U and (p. Since U = V on 
/C"*- which is invariant for both, we can suppose p is cyclic. In that 
case, picking zo G {wo,Wi) and then ATg{z{f{z))) so ATg{zofo{zo)) G 
(0,27r), we see Arg(z(/(z)) G (0,27r) on all of {wq,Wi) since fl2.6|) has 
no solution there. By the strict monotonicity of Arg(/), zf{z) = A has 
at most one solution in [wq, wi], so by Propositions 12 . 1 1 and 12 . 21 V that 
at most one eigenvalue there. □ 

Proposition 2.4. Let U, V be unitaries on C" so ()2.2j) holds for A ^ 
1 and for ip cyclic for U. Then the eigenvalues of U and V strictly 
interlace. 
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Proof. Since U has a cyclic vector, its spectrum is simple so zf{z) = 1 
has n solutions. Since ATg{z{f)) is strictly monotone, zf{z) = X has n 
solutions which interlace the solutions of zf{z) = l. □ 

One can say something about the case where (p is not cyclic. 

Proposition 2.5. Let U, V be unitaries on C" so fj2.2j) holds. Let 
Zq, Z\ he two eigenvalues of U. Then V has an eigenvalue in [zq, zi] 
(= izo,Zi) U {Zo,Zi}). 

Proof. Let /C be the cyclic subspace of {U, ip) which is invariant for U. 
If zq and zi are eigenvalues ofU \ JC, V has an eigenvalue in {zq, zi) by 
Proposition 12.41 If not, since U \ JC^ = V \ JC^, either zq or zi is an 
eigenvalue of V. □ 

Finally, we have a specialized result that is precisely what we need 
to prove Theorem 11.41 

Proposition 2.6. LetU = Ui®U2 on K,i®K,2, two finite- dimensional 
subspaces ofH, a space of dimension n. Let ipj (j = 1,2) be cyclic 
vectors for Uj on Kj. Let (f = a(fi © b(f2 where {a,b) ^ (0,0) and 
|a|2 + |6p = 1. Let V be given by with \ ^ I. If Ui and U2 

have i eigenvalues in common, then V has these i common values as 
eigenvalues and its other n — i eigenvalues strictly interlace those ofU. 

Proof. Since (pj is cyclic for Uj, any simple eigenvalue of U is in the 
cyclic subspace generated by f/, (p. Moreover, any common eigenvalue is 
a simple eigenvalue for [/ f /C where /C = cyclic subspace of p. Thus U \ 
/C has all the eigenvalues of U but with multiplicity 1. The eigenvalues 
oi V \ K, strictly interlace by Proposition 12.41 The eigenvalues of 
V \ K,^ = U \ /C"*" are exactly the common eigenvalues. □ 

Remark, ip is cyclic if and only if £ = 0. 

3. Zeros of POPUC and Finite CMV Matrices 

Given a sequence {7n}$^o elements in D, one defines the CMV 
matrix C{{jn}n=o) f by 

C = CM (3.1) 

/: = e(7o)©e(72)©--- (3.2) 

^ = lixi©e(7i)©0(73)©--- (3.3) 
where lixi is the one-dimensional identity matrix, and is given by 

0(7)= _;) (3.4) 
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r=(l-|7nV2 (3.5) 

It is a fundamental result of Cantero, Moral, and Velazquez j2j (see 
also [3 Section 4.2]) and see (TU] for other references) that if dfi is a 
nontrivial probability measure on 83, Xn is the basis of L^((9©, dfi) ob- 
tained by applying Gram-Schmidt to 1 ^, . . . , and an{dfi) 
are the Verblunsky coefficients of dfi, then C({a„((i/i)}J^Q) is the ma- 
trix of multiplication by z on L^(9D, dfi) in Xn basis. Note in this case 
that 7n € D (rather than some 7„ G 9D), in which case we call C a 
proper CMV matrix. 

If I7I = 1, then r = 0, and 6(7) is a direct sum of two 1x1 matrices, 
and so, if |7n-i| = 1, C breaks into a direct sum of an n x n matrix and 
an infinite piece. The finite piece, C„({7o, . . . ,7„_i}), is called a finite 
CMV matrix. It is not hard to show that (see, e.g., [TU]): 

Proposition 3.1. 7/7^ G © for all n, then 60 = (1,0,... )* is a cyclic 
vector for C{{'yn}n=o)- U lo, ■ ■ ■ ,7n-2 e ©, 7^-1 G OB, then 5o is a 
cyclic vector for Cn{{'ym}^Jo) ■ 

Moreover (see [71 Section 4.2]), 

Proposition 3.2. If ao, ■ ■ ■ , an-2 G D and (3 = a„_i G dlD), then 

$„(^, /5) = det(^ - Cni{a,}]Z^)) (3.6) 

In particular, the zeros of $„ are the eigenvalues of a finite CMV ma- 
trix. 

Finally, we need the following, which generalizes Lemma 4.5.1 of 7j: 
Lemma 3.3. Leta eB and j3 G 9D. Then 

e(a) - {'I (3.7) 



^ = /3|l^'| (3-8) 



is rank one if and only if 



1 

Proof. A 2 X 2 matrix is rank one if and only if det(A) = 0. Since 

det(^e(a)- (^^ =(a-/5)(-a-x)-(l-|ap) (3.9) 

we see fl3.7j) is rank one if and only if RHS of ()3.9j) = 0, which is solved 
by dSH). □ 

Note: \x\ = 1, so (g °) is unitary. 
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Proof of Theorem M.ll Let C be the CMV matrix, C({a„((i/x)}^Q), of 
dfi. Given n and /?, pick x so 6(a„_i) — ( q °. ) is rank one, and let C 
be the matrix obtained from C by replacing 0(a„_i) by ( g °. ) • Then C 
is unitary (by the note after Lemma f3.3j) and C — C is rank one. Thus, 
by Theorem 12 .3^ C has at most one eigenvalue in {wq,wi). But C is a 
direct sum of C„({ao, . . . ,an-2,P}) and another matrix, so C„ has at 
most one eigenvalue in {wo,Wi). By Proposition 13.21 zeros of are 
eigenvalues of C„. □ 

Proof of Theorem, \l.i\ Let = [3. By Theorem 5.2 of jUII (see 

also [7J Theorem 4.2.9]), C„({— am}m=o) unitarily equivalent to = 
^{{c(m}^2o)M{{am}^2o) where M differs from M by having -lixi 
in place of lixi- Thus, C„({am}m=o) ~C is rank one, and Theorem 11.21 
follows from Propositions 12.41 and 13. II □ 

Proof of Theorem \l.r\ If 

aj = a'j = aj{dfi) j = 0, . . . ,n — 2 (3.10) 

an-i = (3 aLi = 13' (3.11) 

then C{{am}Z7=o) ~^({'^m}m=o) obviously rank one. Moreover, 5n-i 
is a cyclic vector since Cn run backwards is essentially another C„ (with 
the initial lixi replaced by q;„_i1ixi) or C* . Thus, Theorem II . 31 follows 
from Proposition 12.41 □ 

Proof of Theorem \1.4\ Let Cn+i (resp. C„) be the (n + 1) x (n + 1) 
(resp. nxn) finite CMV matrix whose characteristic polynomial is 
(resp. $„). By Lemma l3.3| a rank one perturbation turns Cn+i into 
Cn © A^lixi where An is given by ()1.7p . The vector in the perturbation 
is a5„„i + b6n, so Proposition 12.61 applies and proves Theorem 11.41 □ 

As a final result: 

Theorem 3.4. Let m > n. Then strictly between any pair of zeros of 
<l>n is a zero of^m- 

Proof. Let Cn,Cm be as in the last proof. By a rank one perturbation. 
Cm can be changed to C„ © Qm-n- Now apply Proposition 12.61 □ 
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